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I. Basis of the report 



1. With regard to the elements of the international application:* 
| | the international application as originally filed 
the description: 

(See Attached) 



pages 
pages 
pages 



_ , as originally filed 
filed with the demand 



filed with the letter of 



| x| tne claims: 



pages . 
pages 
pages 
pages 



(See Attached) 



, as originally filed 

as amended (together with any statement) under Article 19 

, filed with the demand 



, filed with the letter of 



| x| tne drawings: 



pages 
pages . 
pages . 



(See Attached) 



: , as originally filed 

, filed with the demand 



the sequence listing part of the description: 

pages (See Attached) 

pages 

pages _ 



filed with the letter of . 



, as originally filed 

, filed with the demand 



filed with the letter of . 



2. With regard to the language, all the elements marked above were available or furnished to this Authority in the language in which 
the international application was filed, unless otherwise indicated under this item. , . 
These elements were available or furnished to this Authority in the following language . which is: 

| [ the language of a translation furnished for the purposes of international search (under Rule 23.1(b)). 

| | the language of publication of the international application (under Rule 48.3(b)). 

I I the language of the translation furnished for the purposes of international preliminary examination (under Rules 55.2 and/ 
or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the international 
preliminary examination was carried out on the basis of the sequence listing: 

□ contained in the international application in printed form. 

| | filed together with the international application in computer readable form. 

| | furnished subsequently to this Authority in written form. 

| | furnished subsequently to this Authority in computer readable form. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international application as filed has been furnished. 

I I The statement that the information recorded in computer readable form is identical to the writen sequence listing has 
' — ' been furnished. 

^ | x| The amendments have resulted in the cancellation of: 

NO 



the description, pages_ 
the claims, Nos. 



0 

fx] the drawings, sheets/fig NO 



5. [5T] This report has been drawn as if (some of) the amendments had not been made, since they have been considered to go 

beyond the disclosure as filed, as indicated in the Supplemental Box (Rule 70.2(c)).** 
* Replacement sheets \x4iich fiave been furnished to the receiving Office in response to an invitation under Article 14 are referred to 
in this report as "originally filed and are not annexed to this report since they do not contain amendments (Rules 70.16 
and 70.17). 

**Any replacement sheet containing such amendments must be referred to under item 1 and anne xed to this report. 
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V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



1. statement 



Novelty (N) Claims 1^2 YES 

Claims NONE NO 

Inventive Step (IS) Claims 2-21 YES 

Claims 1, 22 NO 

Industrial Applicability (IA) Claims 1-22 _ YES 

Claims NONE NO 



2. citations and explanations (Rule 70.7) 

Claims 1 and 22 lack an inventive step under PCT Article 33(3) as being obvious over Bigues (U.S. 5,787,425) in view of 
Kauffman et al. (U.S. 5,787,413). 

Bigus discloses a system for performing data mining applications comprising a computer having one or more data storage 
devices connected thereto, wherein a relational database is stored on one or more of the data storage devices and a relational 
database management system, executed by the computer, for accessing the relational database stored on the data storage 
devices (column 12, lines 19-44). Bigus does not disclose an analytic application programming interface (API) that generates 
a set of scalable data mining functions. Kauffman discloses this limitation (column 10, lines 37-49). Bigus also discloses the 
claimed set of scalable data mining functions including queries for execution by the relational database management system 
(column 12, lines 23-34). It would have been obvious to one of ordinary skill in the art at the time of the invention to modify 
Bigus to use an API to generate a set of scalable data mining functions by incorporating the teachings of Kauffman to 
increate the efficiency of data mining applications. 

NEW CITATIONS 

NONE 
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Supplemental Box 

(To be used when the space in any of the preceding boxes is not sufficient) 



I. BASIS OF REPORT: 

This report has been drawn on the basis of the description, 
page(s) 1-18, as originally filed. 
page(s) NONE, filed with the demand, 
and additional amendments: 
NONE 

This report has been drawn on the basis of the claims, 

page(s) NONE, as originally filed. 

page(s) NONE, as amended under Article 19. 

page(s) NONE, filed with the demand. 

and additional amendments: 

Pages 19-23, filed with the letter of 02 February 2001. 

This report has been drawn on the basis of the drawings, 
page(s) 1-5, as originally filed. 
page(s) NO, filed with the demand, 
and additional amendments: 
NO 

This report has been drawn on the basis of the sequence listing part of the description: 

page(s) NONE, as originally filed. 

pages(s) NONE, filed with the demand. 

and additional amendments: 

NONE 

5. (Some) amendments are considered to go beyond the disclosure as filed: 



Continuation of: Boxes I - VIII 



Sheet 10 



NO 
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WHAT IS CLAIMED IS: 

1 . A computer-implemented system for performing data mining 
applications, comprising: 

(a) a computer having one or more data storage devices connected thereto, 
wherein a relational database is stored on one or more of the data storage devices; 

(b) a relational database management system, executed by the computer, for 
accessing the relational database stored on the data storage devices; and 

(c) an analytic application programming interface (API) that generates a set of 
scalable data mining functions including queries for execution by the relational database 
management system, executed by the computer, for performing data mining operations 
direcdy within the database management system. 

2. The system of claim 1 above, wherein the computer comprises a parallel 
processing computer comprised of a plurality of nodes, and each node executes one or 
more threads of the relational database management system to provide parallelism in the 
data mining operations. 

3. The system of claim 1, wherein the scalable data mining functions 
process data collections stored in the relational database and produce results that are 
stored in the relational database. 

4. The system of claim 1, wherein the scalable data rriining functions are 
created by parameterizing and instantiating the analytic API. 

5. The system of claim 1, wherein the scalable data mining functions are 
dynamically generated queries comprised of combined phrases with substituting values 
therein based on parameters supplied to the analytic API. 

6. The system of claim 5, wherein the scalable data mining functions are 
selected from a group of functions comprising Data Description functions, Data 
Derivation functions, Data Reduction functions, Data Reorganization functions, Data 
Sampling functions, and Data Partitioning functions. 

7. The system of claim 6, wherein the Data Description functions comprise 
descriptive statistical functions. 
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8. The system of claim 6, wherein the Data Description functions are 
selected from a group comprising: 

(1) descriptive statistics for one or more numeric columns, wherein the 
5 statistics are selected from a group comprising count, minimum, 

maximum, mean, standard deviation, standard mean error, variance, 
coefficient of variance, skewness, kurtosis, uncorrected sum of squares, 
corrected sum of squares, and quantiles, 

(2) a count of values for a column, 

10 (3) a calculated modality for a column, 

(4) one or more bin numeric columns of counts with overlay and statistics 
options, 

(5) one or more automatically sub-binned numeric columns giving additional 
counts and isolated frequendy occurring individual values 

15 (6) a computed frequency of one or more column values, 

(7) a computed frequency of values for pairs of columns in a column list, 

(8) a Pearson Product-Moment Correlation matrix, 

(9) a Covariance matrix, 

(1 0) a sum of squares and cross-products matrix, and 

20 (1 1) a count of overlapping column values in one or more combinations of 

tables. 

9. The system of claim 6, wherein the Data Derivation functions provide 
column derivations or transformations. 

25 

10. The system of claim 6, wherein the Data Description functions are 
selected from a group comprising: 

(1) a derived binned numeric column wherein a new column is bin number, 

(2) a n-valued categorical column dummy-coded into "n" 0/1 values, 
30 (3) a n-valued categorical column recoded into n or less new values, 

(4) one or more numeric columns scaled via range transformation, 

(5) one or more columns scaled to a z-score that is a number of standard 
deviations from a mean, 

(6) one or more numeric columns scaled via a sigmoidal transformation 
35 function, 
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(7) one or more numeric columns scaled via a base 10 logarithm function, 

(8) one or more numeric columns scaled via a natural logarithm function, 

(9) one or more numeric columns scaled via an exponential function, 

(10) one or more numeric columns raised to a specified power, 

5 (11) one or more numeric columns derived via user defined transformation 

function, 

(12) one or more new columns derived by ranking one or more columns or 
expressions based on order, 

(13) one or more new columns derived with quantile 0 to n-1 based on order 
10 and n, 

(14) a cumulative sum of a value expression based on a sort expression, 

(15) a moving average of a value expression based on a width and order, 

(16) a moving sum of a value expression based on a width and order, 

(1 7) a moving difference of a value expression based on a width and order, 
15 (1 8) a moving linear regression value derived from an expression, width, and 

order, 

(1 9) a multiple account/ product ownership bitmap, 

(20) a product ownership bitmap over multiple time periods, 

(21) one or more counts, amount, percentage means and intensities derived 
20 from a transaction summary, 

(22) one or more variabilities derived from transaction summary data, 

(23) one or more derived trigonometric values and their inverses, including 
sin, arcsin, cos, arccos, esc, arccsc, sec, arcsec, tan, arc tan, cot, and arccot, 
and 

25 (24) one or more derived hyperbolic values and their inverses, including sinh, 

arcsinh, cosh, arccosh, csch, arccsch, sech, arcsech, tanh, arctanh, coth, 
and arccoth. 

1 1 . The system of claim 6, wherein the Data Reduction functions provide 

30 matrix building operations to reduce the amount of data required for analytic algorithms. 

12. The system of claim 6, wherein the Data Reduction functions are selected 
from a group comprising: 
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(1) build one or more data reduction matrices from a group comprising: (i) a 
Pearson-Product Moment Correlations matrix; (ii) a Covariances matrix; 
and (iii) a Sum of Squares and Cross Products (SSCP) matrix, 

(2) export a resultant matrix, and 

(3) restart a matrix operation. 

13. The system of claim 6, wherein the Data Reorganization functions 
provide an ability to reorganize data by joining or de-normalizing pre-processed results 
into a wide analytic data set. 



14. The system of claim 6, wherein the Data Reorganization functions are 
selected from a group comprising: 

(1) create a de-normalized new table by removing one or more key columns, 
and 

15 (2) join a plurality of tables or views into a combined result table. 

15. The system of claim 6, wherein the Data Sampling function provides an 
ability to construct a new table containing a randomly selected subset of the rows in an 
existing table or view. 

20 

16. The system of claim 6, wherein the Data Sample function selects one or 
more data samples of specified sizes from a table. 

17. The system of claim 6, wherein the Data Partitioning function provides 
25 an ability to construct a new table containing at least one randomly selected subset of the 

rows in an existing table or view, wherein the subsets are mutually distinct but all- 
inclusive subsets of data. 

18. The system of claim 6, wherein the Data Partitioning function selects one 
30 or more data partitions from a table using a database internal hashing technique. 

19. The system of claim 1, wherein results of the data mining operations are 
stored in the relational databases. 
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20. The system of claim 1 , wherein the relational database management 
system further comprises an analytical logical data model that stores metadata and 
processing results from the Scalable Data Mining Functions. 

21. A method for performing data mining applications, comprising: 

(a) storing a relational database on one or more data storage devices connected to 
a computer; 

(b) accessing the relational database stored on the data storage devices using a 
relational database management system; and 

(c) utilizing a comprehensive set of parameterized analytic capabilities for 
performing data mining operations directly within a massively parallel relational database 
management system, the set of parameterized analytic capabilities including queries for 
execution by the relational database management system. 

22. An article of manufacture comprising logic embodying a method for 
performing data mining applications, comprising: 

(a) storing a relational database on one or more data storage devices connected to 
a computer; 

(b) accessing the relational database stored on the data storage devices using a 
relational database management system; and 

(c) utilizing a comprehensive set of parameterized analytic capabilities for 
performing data mining operations direcdy within a massively parallel relational database 
management system, the set of parameterized analytic capabilities including queries for 
execution by the relational database management system. 



